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Abstract 

We deal with a general class of extreme- value regression models introduced by Barreto- 
Souza and Vasconcelos (2011). Our goal is to derive an adjusted likelihood ratio statistic 
that is approximately distributed as with a high degree of accuracy. Although the 
adjusted statistic requires more computational effort than its unadjusted counterpart, it is 
shown that the adjustment term has a simple compact form that can be easily implemented 
in standard statistical software. Further, we compare the finite sample performance of the 
three classical tests flikelihood ra tio, Wald, and score), the gradient test that has been 
recently proposed bv lTerrelll (j2002l ). and the adjusted likelihood ratio test obtained in this 
paper. Our simulations favor the latter. Applications of our results are presented. 
Key words: Extreme-value regression; Gradient test; Gumbel distribution; Likelihood 
ratio test; Nonlinear models; Score test; Small-sample adjustments; Wald test. 



1 Introduction 

The extreme-value distributions are frequently used to model extreme events, such as extreme 
floods and wind speed, and in survival or reliability analysis to model the logarithm of lifetime 
data. The literature on statistics of extremes has grown fast due to the increasing interest 
in statistically modeling extreme values (minimum or maximum) in a wide range of areas, 
such as climatology, hydrology, reliability, fina nce, insu r ance, and environmental sciences. A 
classical reference on the su b ject is the bo ok bv lGumbell (119581) . More recent references include 



Kotz and Nadaraiahl fl2000h . IColesI fl200lh . and 



Castillo et al. 



( 120051 ) ■ among others. 



In this paper, we deal with a general class of extreme- value regression models introduced 
by iBarreto-Souza and Vasconcellod (120111 ). The authors presented large-sample inference on 
the parameters, and also considered the issue of correcting the bias of the maximum likelihood 
estimators in small samples. The study of small-sample inference in extreme-value models 
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is relevant since the amount of extreme d a ta ava ilable for analysis may be small in practical 



applications and, as Mueller and RufibachI (120091) pointed out, there are very few articles that 



focus on small sample problems in extreme value theory. Here, we focus on statistical tests in the 
genera l class of extreme-value regression models proposed by 



Barreto-Souza and Vasconcellos 



(120111 ) when the sample size is small. Specifically, our goal is to derive Skovgaard's adjusted 
likelihood ratio statistics in this class of models. We show that the adjustment term has a 
simple compact form that can be easily implemented from standard statistical software. The 
adjusted statistic is approximately distributed as with a high degree of accuracy. Further, we 
compare the finite sample performance of the three classical t ests ( likelihoo d ratio, Wald, and 



score), the gradient test that has been recently proposed by iTerrelll ( l2002l ). and the adjusted 
likelihood ratio test obtained in this paper. 

Let yi, . . . ,ynhe independent random variables, where each yt, t = 1, . . . ,n, has an extreme- 
value distribution with parameters fit and (pt and density function 

/(l/;/^t,0i) = ^exp 1^-^-^-^^ exp |-exp 1^-^-^-^^ I , y e R, (1) 

where fi E M and > are the location and dispersion parameters, respectively. The mean 
and the variance of yt are E(|/t) = fit + £(pt and var{yt) = 0^7r^/6, respectively, where £ is the 
Euler constant; S ~ 0.5772. If y has an extreme-value distribution with parameters fi and </>, 
we write y ~ EVmaxifJ', 4>)- This distribution is also called Gumbel or type I extreme- value 
distribution. Here, we call it the maximum extreme-value distribution to contrast it with the 
minimum extreme- value distribution (see Section 3 below), which is also called Gumbel or type 
I extreme-value in the statistical literature. 

The maximum extreme-value regression model with dispersion covariates is defined by ([T]) 
and by two systematic components given by 

gi^^t) = Vt = Vi^t, 13) (2) 

and 

h{<f)t) = 6t = 6{zt,^), (3) 

where (3 = (/3i, . . . , f3ky and 7 = (71, ... , 7^)"^ are vectors of unknown regression parameters 
(/3 G M'' and 7 G M^, k + m < n) and Xt and zt are observations on covariates. Here, t]{-, •) 
and are continuously twice differentiable (possibly nonlinear) functions in the second 

argument. Finally, g{-) and h{-) are known strictly monotonic and twice differentiable link 
functions that map M and M'^ into M, respectively. Let X be the derivative matrix of rj = 
(?7i, . . . , rjn)"^ with respect to f3~^ . Analogously, let Z be the derivative matrix of 5 = {61, ... , 6n)~^ 
with respect to 7^. It is assumed that rank(X) = k and rank(Z) = m for all /3 and 7. 

The paper unfolds as follows. In Section [2l we derive Skovgaard's adjusted likelihood ratio 
statistic for testing hypothesis on the parameters of the model definide by ([I])- ([3]). In Section [3], 
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we extend our results to a general class of minimum extreme-value regression models. Section 
m is devoted to a simulation study to compare the performance of the three classical tests, the 
gradient test, and the adjusted likelihood ratio test. Our simulation results clearly favor the 
adjusted test proposed in this paper. In Section [5l we illustre the use of our results in two 
real data sets. Section [6] closes the paper with a discussion. Technical details are left for an 
appendix. 



2 Main results 



Let i{9) be the log-likelihood function of the model defined by (11])- (|3]) given the vector of 
observations y = {yi, . . . , yn)- We have 



t=i 



where 



yt - iJ-t 



exp 



yt- f^t 



with fit and defined so that ([2]) and 
can be written as 



Yt \ Yt 

hold. In matrix notation, the log-likelihood function 



= H^-3^-f]l, (4) 
where / = (ln0i, . . . ,ln0„)^, 3 = (31, . . . ,3„)^, and 3 = (exp(-3i), . . . , exp(-3„))^, with 3t = 
{yt — ^t) / <Pti and 1 is the n-dimensional column vector of ones. The score function, obtained by 
differentiating the log-likelihood function with respect to the unknown parameters, is denoted 
hy U = {Ui3{f3, 7)''', ?Xy(/3, 7)''')''', with I and J denoting the expected and observed information 
matrices. We have 

f/^(/3,7)=XT$-iT(l-3), (5) 
f/,(/3,7) = ^^$-'i/(3-23-l), (6) 



J = 


Jpi3 


J Pi 


1 = 




hi 










IlP 


A7 



with 



Jpp 
J Pi 



J, 



77 



Lpp 



'77 



X^^-^T[Z^-^ + {I - Z)STjTX - ^1^ {I - Z)T(!?-^ 
Jj^^ = X^^-^T[I - i + ZZ)H^-^Z, 
Z^<^-^h{{-X + 2Z - 2ZZ + Z'^Z)<^-^ 

+ {-! + Z - ZZ)QH^HZ + 1^{I-Z + ZZ)H^-^ 
X^^-'^T^X, 1,31 



X 
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where X is an n x n identity matrix, Z = diag{3i, . . . ,3„}, Z = diag{exp(— 31), . . . ,exp(— 3„)}, 
$ = diag{0i,...,04, T = diag{l/g'{fxi),...,l/g'{fxn)}, H = diag{l//i'(0i), . . . , 
S = dmg{g"{^i^),...,g"{^ir.)}, Q = diag{/i"(0i), . . . , X = d^r]/df3df3^ and Z = 
d'^6/d'yd'y^ are n x k x k and n x m x m arrays, respectively, and r(-) denotes the gamma 
function, with r'^^)(-) and r*^^-'(-) being its first and second derivatives, respectively. The other 
quantities before. 

Let 6 = {/3~^ , 'y'^ be the unknown parameter vector that indexes the extreme-value re- 
gression model ([I])-®. In what follows, u = {ui, . . . , z/^)^ represents the parameter of interest 
and ip = {ipi, . . . ,'ips)~^ is the nuisance parameter; note that r + s = k + m. We consider 
likelihood-based tests of the null hypothesis "Ho '■ ^ = ^0, where uq is a fixed r-vector. Clearly, 
such tests may be inverted to give confidence sets for z/. Further, let J^^ denote the s x s 
observed information matrix corresponding to ip. Similarly, A^^ denotes a matrix formed from 
the (r + s) X (r + s) matrix A by dropping the rows and columns that correspond to the in- 
terest parameter. Additionally, hat and tilde indicate evaluation at the unrestricted (6) and 
at the restricted (6) maximum likelihood estimator of 6 under T-Lq, respectively. For instance, 
/ = 1(e), 1 = 1(9), a nd J = j(e). 



Skovgaardl (1200 ll ) derived an adjusted likelihood ratio statistic given by 



w 



w-2\og C, 

where w = 2{i{6) — i{0)) is the likelihood ratio statistic, 

{\i\ \T\ |Ji^|}^/2 {u^T'^ij-'Ti-^uy/^ 



(7) 



c 



11/2 



m)] 



(8) 



|T| |{/T J/-iT}^^ 
and q and T come, respectively, from 

q = EeAU{ei) {i{e, 
and 

T = EeAuie,) u^ie)] (9) 

by inserting 6 for 6i and 6 for 6 after the expected values are computed. Note that q is an 
(r + s)-vector and T is an (r + s) x (r + s) matrix. Under "Hq, w is distributed as y3 with 
error of order while w* follo ws this distribution ^y i th hig h de gree of accuracy (Skoygaard . 
p. 7). Simulation results in lFerrari and Pinheird ( 120101 ) and lFerrari and CysneirosI (j2008f ) 
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suggest that tests that use w* are much less size distorted than those that are based on w. 

In order to obtain the adjusted likelihood ratio statistic (I7j) in the extreme- value regression 
model ([I])-©, one needs to obtain the score vector, the observed and expected information 
matrices, J and /, respectively, the vector q, and the matrix T. We obtained 



X^^~^fC{I- MD)1 
Z^$-^H{C{SI + ND) - 1}1 
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and 

__ " X^^-^fCMDT^-^X X^^-^fC{I + D{MD-M-CN)}M-^Z ' 
-Z^$-^HCNbf^-^X Z^$-^HC{SI + D{N + CP - ND)}M-^Z ' 

where C = diag{0ii/0i, . . . , D = diag{(/iii - . . . , - /i„)/0„}, £> = 

diag{exp(-(/iii - /ii)/0i), . . . ,exp(-(/ii„ - M = diag{r(l + 0ii/0i), . . . , 1(1 + 
^m/^n)}, iV = diag{rW(l + 0nM),...,r«(l + 0i„/</)„)}, P = diag{r(2)(l + . . . , 

r*^^^(l + (f)in/(f)n)}, and the other quantities are as given above. Details of the derivations oiq 
and T are given in the Appendix. 

3 Minimum extreme-value regression model 

Let Hi, . . . ,yn be independent random variables, where each yt, t = 1, . . . ,n, has a minimum 
extreme-value distribution with parameters fit and (pt and density function 

f{y; /^t, 0t) = ^ exp exp |- exp | , y e R, (10) 

where fi & M and > are the location and dispersion parameters, respectively. The mean 
and the variance of yt are E{yt) = A^t — ^(pt and Yax{yt) = 0^7r^/6, respectively. If y has 
a minimum extreme- value distribution we write y ~ EVmin{fJ',4')- A useful property of the 
minimum extreme- value distribution is as follows: 

y ~ EVminif^, 0) ^ -y ~ EVmaxi-fJ^, (H) 

The minimum extreme- value regression model with dispersion covariates is defined by (fTOj) and 
by the systematic components ([2]) and ([3]). 

From (iTTl) . it is easy to see that the minimum extreme- value regression model (JTOl) with 
systematic components ([2]) and ([3]) is equivalent to the (maximum) extreme-value regression 
model ([1]) for the response variables Vi = — yi,...,t>„ = — with systematic components 
g*{fit) = fJ't = Vt = ~9^^{v{^t, f^)) and h*{(f)t) = h{(j)t) = S{zt,'y). Hence, inference for the 
minimum extreme-value regression model (flOl) with systematic components ([2]) and ([3]) can be 
performed from the results derived in Section [2] by changing the signs of the observations on 
the response variable and using an identity link function for the location parameter with the 
modified predictor rj^. As a result, the adjusted likelihood ratio statistic derived in Section [2] 
can be easily computed for the minimum extreme-value regression model. 

4 Monte Carlo simulation results 

We now present Monte Carlo simulation results on the small sample behaviour of the likelihood 
ratio test (w), the Wald test (W), the score test (Sji), the gradient test (5*^), and the adjusted 
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likelihood ratio test (w*). The Wald, score, and gradient statistics are given hj W = {u — 
z/o)^ {r")-^ iu-Po), Sr = Uj Uy, and St = Ujiu-uo). Note that the gradient statistic is 
very simple to compute since it does not involve the information matrix, neither the observed 
one nor the expected one. 

The maximum likelihood estimation of the unknown parameters was performed using the 
quasi-Newton BFGS nonlinear optimization algorithm wi th analytica l deriv atives developed by 



Broyden, Fletcher, Goldfarf & Shanno (see, for instance, IPress et al. ( 



19921)') and imp lemented 



in the function MAXBFGS in the matrix language programming Ox (iDoorniki . |2009| ). 

All the size simulation results are based on 10,000 Monte Carlo replications and the nominal 
level of the tests are a = 10%, 5%, and 1%. We also present power simulation results. Since 
the different tests display different sizes when a distribution is used, we simulated 500,000 
samples to estimate the critical values of the tests that give exact size, i.e., size equal to the 
chosen nominal level. Our power simulation results are obtained using exact critical values. 

We consider model ([1]) with constant dispersion and location parameter given by 

/it = /3i + (32Xt2 + PzXtz + PiXti + /^sa^ts, 

wich we refer as 'model 1'. Three null hypotheses are considered, T-Lq : /32 = (r = 1), 
'Ho : /32 = /Sa = (r = 2), and T-Lq : /32 = /Ss = /34 = (r = 3), and these are to be tested against 
a two-sided alternative. For the first case, we set /3i = 1, = 0, /^s = 1, = 6, and (3^ = —3; 
for the second case, f3i = 1, /32 = (3^ = 0, (3^ = 6, and /Ss = —3; and for the third case, (3i = 1, 
(^2 = = Pa: = 0, and = —3. The value of was fixed at = O.lQ The covariate values 
were obtained as random draws from a W(— 0.5,0.5) distribution and the sample sizes are 15, 
20, 30, and 40. 

Table [1] presents the null rejection rates of the five tests. It can be noticed that the likelihood 
ratio and Wald tests are markedly liberal in small samples. For instance, for n = 15, r = 1, 
and a = 5%, the null rejection rates of these tests are 11.6% and 20.8%, respectively. The 
gradient test is liberal in many cases but not as much as the likelihood ratio and the Wald 
tests. The score test is less liberal and displays conservative behavior in some cases. The 
adjusted likelihood ratio test is clearly the least size distorted. For the case mentioned above, 
the null rejection rates of the score, gradient, and adjusted likelihood ratio tests are 5.8%, 9.0%, 
and 5.0%, respectively. 

Figure [1] shows the plots of the relative quantile discrepancies versus corresponding asymp- 
totic quantiles for r = 1, 2, 3, and n = 20, 30, 40. Relative quantile discrepancy is defined as the 
difference between exact (estimated by simulation) and asymptotic quantiles divided by the 
latter. The closer to zero the relative quantile discrepancy, the better is the approximation of 
the exact null distribution of the test statistic by the limiting distribution. The plots confirm 



^For linear extreme- value regression models with constant dispersion, the null distributions of the five statis- 
tics do not depend on (j). The proof is omitted to save space. 
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Table 1: Null rejection rates (%); model 1 









a 


= 10% 








a 


= 5% 








a 


= 1% 






r 


n 


w 


W 


Sr 


St 


w* 


w 


W 


Sr. 


St 


w* 


w 


W 


Sr 


St 


w* 




15 


19.3 


28.2 


11.5 


16.9 


10.4 


11.6 


20.8 


5.8 


9.0 


5.0 


4.0 


11.1 


1.1 


1.8 


1.0 


1 


20 


16.8 


22.1 


11.3 


15.0 


9.9 


10.0 


15.6 


5.4 


7.8 


4.8 


3.0 


7.0 


0.9 


1.5 


0.9 




30 


14.9 


18.9 


10.9 


13.8 


10.6 


8.5 


12.0 


5.3 


7.1 


5.4 


2.4 


4.6 


0.9 


1.4 


1.2 




40 


13.0 


15.7 


10.3 


12.1 


10.1 


7.0 


9.5 


4.9 


6.2 


5.1 


2.0 


3.5 


0.9 


1.3 


1.1 




15 


22.9 


36.5 


12.7 


16.4 


7.7 


14.3 


28.7 


6.2 


7.9 


3.5 


4.8 


17.1 


1.1 


0.8 


0.6 




z 


on 


io.o 


Zo.o 


i i.D 


1/11 


y.z 


11/1 


on n 
zu.y 


0.4 


7 


4.D 


o.o 


1 n 1^ 

iU.O 


u.y 


U.o 


U.o 




30 


16.0 


23.4 


10.8 


13.0 


10.2 


9.2 


15.7 


5.2 


6.6 


5.2 


2.3 


6.7 


0.8 


0.9 


0.9 




40 


13.9 


19.0 


10.2 


11.7 


9.8 


7.6 


12.0 


5.0 


6.1 


5.2 


1.9 


4.5 


0.7 


0.9 


0.9 




15 


23.4 


42.9 


9.1 


12.6 


6.8 


14.9 


34.2 


3.4 


4.8 


3.0 


5.1 


21.2 


0.2 


0.2 


0.5 


3 


20 


19.8 


33.8 


9.8 


12.3 


8.7 


12.1 


25.1 


4.4 


5.4 


4.4 


3.8 


14.0 


0.5 


0.4 


0.8 




30 


16.6 


26.4 


10.1 


12.1 


9.9 


9.5 


18.2 


4.7 


5.4 


4.9 


2.5 


8.6 


0.6 


0.6 


0.9 




40 


14.7 


21.4 


10.2 


11.3 


10.3 


8.3 


14.0 


4.6 


5.5 


5.2 


2.1 


5.8 


0.7 


0.7 


0.9 



the tendency of the likehhood ratio and the Wald tests of rejecting the null hypothesis with 
higher frequency than expected based on the nominal level. It is clear that the distribution of 
the adjusted likelihood ratio statistic (w*) closely agrees with the reference distribution. The 
effect of the proposed adjustment becomes evident. Notice that the best agreement between 
the exact distribution and its asymptotic counterpart is achieved by w* for all the sample sizes. 

We now focus on the power comparisons of the five tests for r = 1, n = 30, and a = 10%. 
The rejection rates were obtained under the alternative hypothesis Hi '■ P2 = ^ for different 
values of e through Monte Carlo simulation. Figure [2] gives the plots of the power function of 
the tests. Visual inspection shows that the curves are practicaly coincident, i.e., the five tests 
display similar powers. 

Now, consider model with systematic components for the location and scale parameters, 
respectively, given by 

fxt = /3i + (32Xt2 + /^aXis + /3^xt4, 

and 

ln(0t) =71+ 722:42 + 732^*3 + 74^M, 

wich we refer to as 'model 2'. 

We consider three different null hypotheses, T-Lq : (3^ = {r = 1) , TIq : (3^ = (3^ = {r = 2) , 
and Tio ■ (^2 = (^3 = (3^ = {r = 3) , and these are to be tested against two-sided alternatives. 
The values for the f3s are f3i = 1, (32 = 1, (3^ = 6, and (34 = 0; (3i = 1, (32 = 1, and (3z = (3^ = 0; 
and (3i = 1 and (32 = (3z = (3^ = 0, for the first, second, and third cases, respectively. Further, 
we set 7i = In(O.l) ~ —2.30, 72 = —2, 73 = —2, and 74 = 0.1. The covariate values were 
randomly drawn from a W(— 0.5, 0.5) distribution and the sample sizes are 40, 50, 60, and 70. 
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Figure 1: Relative quantile discrepancies, model 1. 



Table [2] gives the null rejection rates of the five tests and Figure [3] shows the plots of relative 
quantile discrepancies. We note that the results for model 2 show similarity with those for model 
1. The likelihood ratio and Wald tests are clearly oversized, i.e. its type I error probability is 
greater than the nominal level, and their distributions are much different from the asymptotic 

distribution if the sample is not large. Again, the best agreement between the true and 
asymptotic quantiles is reached by w*, the adjusted likelihood ratio statistic proposed in this 
paper. The score test presents good behavior but tends to be conservative when r > 1. The 
gradient test is liberal in many cases, but high order quantiles of the gradient statistic are close 
to the asymptotic quantiles. We also performed power simulation comparisons among the five 
tests. Overall, the tests are equally powerful when true critical values are used. 

We now consider model ([T]) with constant dispersion and a nonlinear specification for the 
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Figure 2: Power of the tests; model 1; r = 1, n = 30, a = 10%. 



location parameter given by 



/3o + f3ixti + x§, 



wich we refer to as 'model 3'. 

Here, X is an n x 3 matrix whose t-th row is {1, Xti,ln{xt2)x^2) , Z = X, T = X, H = X, 
S and Q are matrices of zeros, X = d'^rj/dpdp^ is such that d'^rjt/ 8/328^2 = In(xt2)^a;f2 
t = 1, . . . ,n and zero otherwise, and Z = d'^S/d'jd'-f'^ is an x m x m array of zeros. The 
bracket product of the Ix n vector 1^ (X — Z) r$^^ and the n x 3 x 3 array X is an 1 x 3 x 3 
array, i.e., a 3 x 3 matrix, whose {i,j)-th element is 




exp 



Vt - l^t 



if (i, j) = (3,3) and zero otherwise. 

The null hypothesis under test is T-Lq : /32 = (r = 1). We set = e^'"*^ ~l.l,/3o = l,/3i = l, 
and P2 = 0, and the covariate values were drawn from a W(0, 1) distribution. The sample sizes 
are 15, 20, 30, and 40. Tableland Figure H] show our simulation results. 

From Table [3], we note that the tests that use w and W are typically liberal while the null 
rejection of the other tests keeps their sizes closer to the nominal levels. The adjusted likelihood 
ratio test and the score test display better performance than the others. For example, for n = 15 
and a = 10%, the null rejection rates of the tests are 16.6% (likelihood ratio), 22.2% (Wald), 
13.4% (gradient), 10.2% (score), and 10.0% (adjusted likelihood ratio). 

Figure H] shows that the reference distribution is not a good approximation for the null 
distribution of w and W, but is close to the true null distribution of the score and the adjusted 
likelihood ratio statistics. The high order quantiles of the gradient statistic closely agree with 
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Table 2: Null rejection rates (%); model 2 









a 


= 10% 








a 


= 5% 








a 


= 1% 






r 


n 


w 


W 


Sr 


St 


w* 


w 


W 


Sr 


St 


w* 


w 


W 


Sr 


St 


w* 


1 


40 


17.5 


24.2 


10.8 


15.3 


11.1 


10.4 


16.8 


5.7 


8.4 


6.1 


3.4 


8.0 


1.0 


1.7 


1.6 




50 


15.9 


20.6 


10.8 


14.1 


10.1 


8.9 


13.4 


5.2 


7.2 


5.3 


2.3 


5.4 


0.8 


1.4 


1.1 




60 


15.9 


21.9 


10.6 


13.9 


10.7 


9.3 


15.3 


5.1 


7.0 


5.7 


2.4 


7.0 


0.8 


1.2 


1.2 




70 


14.8 


19.9 


10.0 


12.9 


10.5 


8.4 


13.1 


5.0 


6.6 


5.4 


2.0 


5.6 


0.7 


1.1 


1.2 


2 


40 


21.7 


36.9 


8.3 


13.7 


11.8 


13.7 


28.6 


3.7 


6.4 


6.6 


4.5 


16.6 


0.6 


0.9 


2.0 




50 


18.6 


29.4 


9.0 


13.5 


10.9 


11.0 


21.4 


3.9 


6.3 


5.7 


3.2 


11.1 


0.5 


0.9 


1.4 




60 


17.0 


26.9 


9.0 


12.8 


10.6 


9.7 


19.6 


3.9 


5.9 


5.4 


2.9 


9.9 


0.5 


0.9 


1.3 




70 


15.8 


24.5 


8.8 


12.4 


10.3 


9.0 


17.1 


4.0 


5.6 


5.2 


2.3 


8.0 


0.6 


0.9 


1.2 


3 


40 


23.1 


43.2 


7.9 


12.1 


11.9 


14.1 


34.6 


3.3 


5.1 


6.8 


4.7 


21.7 


0.3 


0.6 


2.1 




50 


19.3 


34.0 


8.1 


12.1 


10.9 


11.5 


25.9 


3.5 


5.2 


5.6 


3.4 


14.4 


0.4 


0.7 


1.4 




60 


17.2 


30.4 


8.1 


11.4 


10.1 


9.9 


22.3 


3.6 


5.4 


5.2 


2.8 


11.6 


0.5 


0.8 


1.3 




70 


16.4 


27.3 


8.3 


11.4 


10.2 


9.4 


19.8 


3.6 


5.0 


5.0 


2.3 


9.2 


0.6 


0.7 


1.1 



Table 3: Null rejection rates (%); model 3 
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1.0 
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Figure 3: Relative quantile discrepancy; model 2. 
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Figure 4: Relative quantile discrepancy, model 3. 

the corresponding quantiles of the reference distribution. The five tests have similar power 
performance (results not shown). 



5 Applications 

In this section, we illustrate applications of our results in two real datasets. First, we deal with 
a dataset presented in Faivre & Masle (1988) and in Huet et al. (2004). The aim is to study the 
growth of winter wheat, by measuring the differences in dry weights of wheat tillers and stems. 
The explanatory variable x, measured on a cumulative degree-days scale, is an integral in time 
of all temperatures at which the wheat is submitted that are above the smallest temperature at 
which wheat can develop. Temperatures are measured in degrees Celsius and time is measured 
in days, the initial time being determined by the physiological state of the wheat. Plants 
growing on n = 18 randomly chosen small areas of about 0.15 are harvested each week and 



the dry weights of the tillers for plants harvested 
A detailed description of the data can be found in 



rom each area ar e measured in milligrams. 



Huet et al. 



(|200J, p. 61). 



Barreto-Souza and Vasconcellod (120111 ) assumed that the dry weight of tillers {yi 



Vis) 



is independent and follows a nonlinear extreme-value regression model ([T]) with 
fit = /3o + e^'+^'''' and In 0^ = 71 x*, t = 1, . . . , 18. 



They focused on the issue of correcting the bias of the maximum likelihood estimates (MLEs) 
of the parameters. The authors constructed confidence intervals based on the asymptotic 
normality of the MLEs and of the bias-corrected MLEs. Their simulation study suggested 
that the asymptotic confidence intervals centered at the bias-corrected estimators produce 
coverage probability closer to the nominal confidence coefficient than those centered in the 
uncorrected MLEs. However, the choice of the estimator (uncorrected or corrected MLE) does 
not change the approximation error between the true coverage probabilty and the nominal 
confidence coefficient. In fact, the correction that they derived only guarantees that the bias of 
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the corrected estimators are of order 0(n~^), but does not change the convergence rate of the 
distribution of the estimators to the normal distribution. 

Here, we illustrate the use of the five test statistics, namely the likelihood ratio, Wald, 
score, gradient, and adjusted statistic derived in this paper, to obtain interval estimates for 
the parameters. By choosing the 5% nominal level, the approximate confidence coefficient is 
95%. We emphasize that the confidence intervals obtained from the adjusted likelihood ratio 
statistic have coverage probabilities that are approximatel y equal to 9 5 % wit h a high degree of 
accuracy. This is guaranteed by the theoretical results in ISkovgaardl (l200ll ) and is confirmed 
in our simulation study. 

The 95% confidence intervals obtained by inverting the five tests are presented in Table S] 
and in Figure O It can be seen that the intervals obtained from the likelihood ratio and the 
Wald tests tend to be shorter than those obtained from the other tests as expected, since the 
first two are the most liberal. We emphasize that theoretical and empirical findings indicate 
that the confidence intervals constructed from the adjusted likehhood ratio statistic should be 
regarded as the most accurate. 



Table 4: 95% confidence intervals 





w 


W 


5fl 


St 


w* 




(47.6; 104.7) 


(52.7; 110.5) 


(31.4; 100.5) 


(38.6 ; 103.7) 


(39.7; 104.9) 


Pi 


(-5.084; -1.111) 


(-4.558; -0.922) 


(-7.604; -1.200) 


(-6.035; -1.113) 


(-5.486; -0.956) 




(0.0117; 0.0175) 


(0.0114; 0.0167) 


(0.0118; 0.0213) 


(0.0117; 0.0189) 


(0.0115; 0.0181) 


71 


(0.00610; 0.00723) 


(0.00603; 0.00720) 


(0.00619; 0.00740) 


(0.00612; 0.00727) 


(0.00622; 0.00751) 



Our second application d eals with a dataset consisting of 34 men's decathlon performance 



ataset consistmg oi d4 
Jl996l . p. 304) @ We 



at the 1988 Olympic Games iHand et al.l (119961 . p. 304) o We assume that the score in high 



jump follows an extreme-value regression model ([T]) with constant dispersion and systematic 
component for the location parameter given by 

for t = 1,...,34. The covariates are the scores in the following events: javelin throw (xi), 
long jump (X2), discus throw (xs), shot put (X4), and pole vault (X5). The statistics and the 
corresponding p- values for testing T-Lq : /3i = against Hi : /3i 7^ are presented in Table [5l 

Note that the p-values vary from 0.0168 (Wald test) to 0.1038 (adjusted likelihood ratio 
test). Only the likelihood ratio and Wald tests reject the null hypothesis at the 5% nominal 
level, but this conclusion may be misleading since these tests showed liberal behaviour in our 
simulations. The most reliable test, namely, the adjusted likelihood ratio test, does not suggest 
that the null hypothesis should be rejected (p- value = 0.1038). 



The dataset is also available at http : //www. stat .ncsu.edu/working_groups/sas/ sicl/data/olympic . dat 
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(3.973) 
(3.636) 
(6.404) 
(4.922) 
(4,530) 



-r 



-r 



-r 



^ W ■ ■ • ■ SR - - ST w* 



(0.0058) 
(0.0053) 
(0.0095) 
(0.0072) 
(0.0066) 



P2 



w W ■■■■ SR-- ST w* 



0.0070 
Y1 



(0.00113) 
(0.00117) 
(0.00121) 
(0.00115) 
(0.00129) 



Figure 5: 95% confidence intervals; length in parentheses. 

Table 5: Test statistics and tlie corresijoiiding /;- values 





((' 


ir 


Sb 


S-r 


(('* 


statistic 
p- value 


4.0407 
0.0444 


5.7161 
0.0168 


2.8208 
0.0930 


3.6293 
0.0568 


2.6466 
0.1038 



6 Conclusion 

In this paper, wc derived an adjusted version of the likelihood ratio statistic that provides 
accurate inference in extreme-value regression models in small- to moderate-sized samples. 
Our simulation results suggest that the likelihood ratio and the Wald tests can be markedly 
oversized in small- and moderate-sized samples. The gradient test can be oversized, but much 
less than the other two tests. The score test is even less size distorted and can be conservative 
in some cases. We emphasize that the score test performs clearly better than the likelihood 
ratio, Wald and gradient tests, and is competitive with the adjusted hkehhood ratio test in most 
cases. The adjusted likelihood ratio test obtained in this paper performs better than all the 
others. Although it requires more computational effort, is the least size distorted in most cases 
and it is, therefore, recommended for practical applications. We emphasize that our simulations 
were carried out in extreme-value regression models with linear and non-linear predictors for 
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both location and dispersion parameters. All simulation results exhibited reasonably similar 
behavior. 
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Appendix 



Let y ~ EVmaxin, 4>) and z = {y - EVmaxiO, 1). We have 

/oo 
z" exp(— cz) exp(— 2; — ex.p{—z))d. 

Using the transformation v = exp(— z), we have 



z, n = 0, 1, 



E(z" exp(-cz)) 



— lnv)^v'^v exp(—v) — dv 

V 

-irr(")(l + c), n = l,2,.. 



-l)"(lnz;f ?;^exp(-z;)d';; 



see (4.358) in ICradshtevn et al.l hood ). Since T^'^\ 



[n 



-(n — l)!(l/n + £ — X]fc=i obtain 

E{z) = £, E(z2) = r(2)(l), E(exp(-cz)) =r(l + c), E(exp(-(1 + c)z)) = r(2 + c), 
E(zexp(-z)) = £■ - 1, E(zexp(-2z)) = 2£' - 3, E(zexp(-cz)) = -r(i)(l + c), 
E(zexp(-(l + c)z)) = -r(i)(2 + c), E{z^exp{-z)) = r(2)(2), 
E(z2exp(-cz)) = r(2)(l + c), E(z2exp(-(1 + c)z)) = r'-^\2 + c). 

Let e = (exp(-(,/.nM)3i),...,exp(-(0i„/0„)3„))T. Since r«(n) = r(n - 1) + (n- l)r«(n- 1) 
and r(2)(n) = 2r(i)(n - 1) + (n - l)r(2)(n - 1) we can write after some calculations that 

E.(3) = l, E^(3) = ^l, EUZ) = £I, EUZ)=I, £^(^3) = - 1)1, 

EUZZ) = {£-1)I, EUZ^Z)=r(^\2)I, EUe) = Ml, E^(3"f ) = 11^ + Z, 

E^(3eT) = ll^M + CM, EJ^'') = Sll" - I, E^if) = E^li), 

EUde^) = £ll^M -£M -N, E^{u^) = £^1^ - £H + T^^\l)I, E^Ze) = -Nl, 

E^(3e^Z) = -11^ N -M - CN, E^(3e^Z) = -£11^ N + £N + P, 

E^(Z3f ) = {£- 1)11^ + {£- 2)1, 

E^(Z3e^) = {£- 1)11^ M - {£ - 1)M -M - {I + C)N, 

EUZii") = £{£ - 1)11^ - £{£ - i)2: + r(2)(2)i, 

E^iZje^Z) = -{£ - l)l\^N +{£-l)N + 2N + P + CP. 
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Now, let yt ~ EVmaxil^it, 4>it), lit = {Vt - l^it)/4>it ~ EV„^ax{0, 1) and 3t = (yt - Ht)/4>t- We can write 
3i = {4>it/(t>t)lit + (/Uit - P-t)/(t>t- Therefore, 

3 = C31 + Dl (12) 

and 

l = bei, (13) 

where ei = (exp(— ((/)ii/(/)i)3ii), . . . , exp(— ((/)i„/0„)3i„))~'' and the other quantities are as given earlier. 
The vector q given in ^ is given by 

^ r E^jC/;3(a;i)£(a;i)] -E^jC/^(a;i)%)] " 
[ JC/^ (t^i )£(^i )] - JC/^ (^1 )£(^i )] _ ■ 

Prom ([1]), (l5|), and the expected values obtained above, we have 

E^,[C/;3(u;i)^(u.i)] = E^,{xTci>-iri(l-3i)[-r-3T-3T]l} 

= Xl^^^Ti{-ll^ - 1E^,(3T) - 1E^,(3T) + E^^di)^ + E^^di^T) + ^u.,{lifi)}l 
= Xl^^^Ti{-lf - fll^ - 11^ + ir + (fll^ -2:) + (11^ +Z)}1 = 01. 

Now, from (jl2|) and (|13p . we have 

E^J[//3(^^i)£(^^)] = E^jx7«>riri(l-3i)[-r-3^-f]l} 

= Xl$ririE^J(l - 31) [-/T - - 1^1) _ eT^]l} 

= xT^^iTil-ir - 1E^,(3T)C - 11^ D - lE,,(eT)^ + E^.di)^ + E^^dia^C + 
E^, (31)1^1? + E^,(3ieT)^}l 

= ^Ti{-ir - fii^c - - i{-i^M)b + ir + (fii^ - x)c + 

11^ L> + (ll^M + CM)b}l = Xl^^^TiC{Mb - 1)1. 

Hence, 

Prom dH, dl]), and the results above, we have 

E^J[/^(a;i)/(wi)] = E^jZ|$rii7i(3i-Zi3i-l)[-r-3T-3T]l} 

= Z|^>ri/7i{-E^,(3i)r -E^,(3i3T) -E^,(3i3T) +E^,(Zi3i)r 

+Eo,,(^i3i3T) + ^uAZihfi) + 1^^ + lEc.i(3T) + 1E^,(3T)}1 

= z7^'5-^iJi{-r(2)(i)z + 2£'z + r(2)(2)z-z}i 
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and from (fT2|) and (fT3]) we get 

E^,[[/^(a;i)/M] = E^AZl^i^Hi{ii-2ih--\-)[-i' -i -f]l} 

= E^jZ|$r'^i(3i - ^131 - -alC - I'D - e[b]l} 

= Zl^^^Hi{-E^,{u)f - E^,(3i3T)C7 - E^.iu)!" D - E^,{iie[)D 

+E<,,(Zi3i)r + E^,{Ziuii)C + E^.iZiun^D + E^,{Ziue[)D 

+lf + lE^Ai[)C + 11^ D + lE^,(el)D}l 
= -Zj<^^^HiC{£I + Nb}l. 

It follows that 

E^,[Uy{u;i)liuJi)]-E^^[U^{uJi)l{io)] = Zj^^^ Hi{C{Sl + N D) - 1)1. 

Hence, 

_ r Xl^:[^TiC{I - Mb)l 
^~ Zj^:^^Hi{C{£Z + Nb)-Z]l 

The matrix T given in ([9|) can be written as 

[ E^J[/^(a;i)C/J(^)] E^J?7^(l^i)C/;H] 
Prom ([5]), (fT3|) . and the expected values obtained in the begining of this Appendix, we have 

E^,{Up{uJi)Ul{uj)} = E^,{Xl^^^Ti{l-ii)[X'^-^T{l-bei)f} 

= Xl^^^Ti{ll^ - lE^,{e[)b-E^,{ii)l' +E^,{iie[)b)T^-^X 
= ^Ti (11^ - ll^MZ) - ll"^ + (ll^M + CM)b)T<^-^X 

= Xl^-[^TiCMbT^-^X. 

The other blocks of T are derived in a similar fashion. We obtained 

E^,{Up{iOi)Uj^iu;)} = Xl^:^^TiC{Z + b{MD - M - CN)}H^-^Z, 

E^,{U^{u:i)Ul{u)] = -Zj^^^HiCNbT^'^X, 
E^AU^{oji)Ul^{oj)} = Zl^^^HiC{EX + b{N + CP-ND)}H^-^Z. 

Therefore, 

^ _ " Xj^^^TiCMbT^-^X ^riC{X + b{MD -M - CN)}H^-^Z ' 

-Zj^]^^HiCNbT^-^X Zj<^>^^HiC{£I+ b{N + CP - ND)}H<^-^Z 

The vector q and the matrix T are obtained from q and T given above by replacing Xi, <I>i, Ti, Zi, and 
Hi by X, T, Z, and H, respectively, and X, $, T, Z, and by X, T, Z, and H, respectively. 
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